Dataset statistics
| Number of variables | 30 |
|---|---|
| Number of observations | 40031 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 9.5 MiB |
| Average record size in memory | 248.0 B |
Variable types
| Numeric | 13 |
|---|---|
| DateTime | 2 |
| Categorical | 13 |
| Unsupported | 2 |
year_number has constant value "2019" | Constant |
Product_SKU has a high cardinality: 1113 distinct values | High cardinality |
Product_Description has a high cardinality: 403 distinct values | High cardinality |
Transaction_ID is highly correlated with week_number and 1 other fields | High correlation |
week_number is highly correlated with Transaction_ID and 1 other fields | High correlation |
month_number is highly correlated with Transaction_ID and 1 other fields | High correlation |
Location is highly correlated with year_number | High correlation |
Product_Category is highly correlated with year_number and 1 other fields | High correlation |
year_number is highly correlated with Location and 9 other fields | High correlation |
Coupon_Code is highly correlated with year_number and 2 other fields | High correlation |
User_type is highly correlated with year_number | High correlation |
GST is highly correlated with Product_Category and 2 other fields | High correlation |
Coupon_Status is highly correlated with year_number | High correlation |
Discount_pct is highly correlated with year_number and 1 other fields | High correlation |
Gender is highly correlated with year_number | High correlation |
Visit_days_average is highly correlated with year_number | High correlation |
revenue_seg is highly correlated with year_number | High correlation |
Quantity is highly skewed (γ1 = 20.23009436) | Skewed |
revenue is highly skewed (γ1 = 42.29610585) | Skewed |
Transaction_Date_Month_x is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Transaction_Date_Month_y is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Reproduction
| Analysis started | 2022-01-08 19:31:27.458280 |
|---|---|
| Analysis finished | 2022-01-08 19:31:58.988886 |
| Duration | 31.53 seconds |
| Software version | pandas-profiling v2.11.0 |
| Download configuration | config.yaml |
CustomerID
Real number (ℝ≥0)
| Distinct | 734 |
|---|---|
| Distinct (%) | 1.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15317.42497 |
|---|---|
| Minimum | 12347 |
| Maximum | 18283 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 625.5 KiB |
Quantile statistics
| Minimum | 12347 |
|---|---|
| 5-th percentile | 12625 |
| Q1 | 13815 |
| median | 15281 |
| Q3 | 16946.5 |
| 95-th percentile | 17961 |
| Maximum | 18283 |
| Range | 5936 |
| Interquartile range (IQR) | 3131.5 |
Descriptive statistics
| Standard deviation | 1767.723598 |
|---|---|
| Coefficient of variation (CV) | 0.1154060556 |
| Kurtosis | -1.229290173 |
| Mean | 15317.42497 |
| Median Absolute Deviation (MAD) | 1577 |
| Skewness | -0.007157434299 |
| Sum | 613171839 |
| Variance | 3124846.719 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 12748 | 695 | 1.7% |
| 15311 | 587 | 1.5% |
| 14606 | 575 | 1.4% |
| 17841 | 572 | 1.4% |
| 14911 | 523 | 1.3% |
| 13089 | 366 | 0.9% |
| 15039 | 315 | 0.8% |
| 17850 | 297 | 0.7% |
| 14646 | 290 | 0.7% |
| 13081 | 261 | 0.7% |
| Other values (724) | 35550 |
| Value | Count | Frequency (%) |
| 12347 | 60 | |
| 12348 | 23 | 0.1% |
| 12370 | 91 | |
| 12377 | 77 | |
| 12383 | 69 |
| Value | Count | Frequency (%) |
| 18283 | 102 | |
| 18269 | 8 | < 0.1% |
| 18260 | 40 | 0.1% |
| 18245 | 55 | |
| 18239 | 52 |
| Distinct | 19240 |
|---|---|
| Distinct (%) | 48.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 32218.71512 |
|---|---|
| Minimum | 16679 |
| Maximum | 48468 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 625.5 KiB |
Quantile statistics
| Minimum | 16679 |
|---|---|
| 5-th percentile | 18778.5 |
| Q1 | 25232 |
| median | 32405 |
| Q3 | 38731.5 |
| 95-th percentile | 46379 |
| Maximum | 48468 |
| Range | 31789 |
| Interquartile range (IQR) | 13499.5 |
Descriptive statistics
| Standard deviation | 8516.175972 |
|---|---|
| Coefficient of variation (CV) | 0.2643238857 |
| Kurtosis | -0.996507855 |
| Mean | 32218.71512 |
| Median Absolute Deviation (MAD) | 6795 |
| Skewness | 0.04324610802 |
| Sum | 1289747385 |
| Variance | 72525253.18 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 34094 | 28 | 0.1% |
| 38059 | 27 | 0.1% |
| 24820 | 24 | 0.1% |
| 33228 | 23 | 0.1% |
| 39392 | 23 | 0.1% |
| 34189 | 22 | 0.1% |
| 36082 | 22 | 0.1% |
| 32526 | 22 | 0.1% |
| 33668 | 21 | 0.1% |
| 36871 | 21 | 0.1% |
| Other values (19230) | 39798 |
| Value | Count | Frequency (%) |
| 16679 | 1 | < 0.1% |
| 16680 | 1 | < 0.1% |
| 16681 | 1 | < 0.1% |
| 16682 | 10 | |
| 16684 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 48468 | 1 | |
| 48467 | 2 | |
| 48466 | 1 | |
| 48465 | 1 | |
| 48464 | 1 |
Transaction_Date
Date
| Distinct | 365 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 625.5 KiB |
| Minimum | 2019-01-01 00:00:00 |
|---|---|
| Maximum | 2019-12-31 00:00:00 |
| Distinct | 1113 |
|---|---|
| Distinct (%) | 2.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 625.5 KiB |
| GGOENEBJ079499 | 2660 |
|---|---|
| GGOENEBQ078999 | 2579 |
| GGOENEBB078899 | 2482 |
| GGOENEBQ079099 | 1037 |
| GGOENEBQ079199 | 812 |
| Other values (1108) |
Length
| Max length | 14 |
|---|---|
| Median length | 14 |
| Mean length | 13.99980015 |
| Min length | 12 |
Characters and Unicode
| Total characters | 560426 |
|---|---|
| Distinct characters | 34 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 99 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | GGOENEBJ079499 |
|---|---|
| 2nd row | GGOENEBJ079499 |
| 3rd row | GGOENEBQ078999 |
| 4th row | GGOENEBQ079099 |
| 5th row | GGOENEBJ079499 |
| Value | Count | Frequency (%) |
| GGOENEBJ079499 | 2660 | 6.6% |
| GGOENEBQ078999 | 2579 | 6.4% |
| GGOENEBB078899 | 2482 | 6.2% |
| GGOENEBQ079099 | 1037 | 2.6% |
| GGOENEBQ079199 | 812 | 2.0% |
| GGOENEBQ084699 | 809 | 2.0% |
| GGOENEBQ086799 | 614 | 1.5% |
| GGOEGFKQ020399 | 598 | 1.5% |
| GGOENEBQ086499 | 444 | 1.1% |
| GGOEGDHC018299 | 433 | 1.1% |
| Other values (1103) | 27563 |
| Value | Count | Frequency (%) |
| ggoenebj079499 | 2660 | 6.6% |
| ggoenebq078999 | 2579 | 6.4% |
| ggoenebb078899 | 2482 | 6.2% |
| ggoenebq079099 | 1037 | 2.6% |
| ggoenebq079199 | 812 | 2.0% |
| ggoenebq084699 | 809 | 2.0% |
| ggoenebq086799 | 614 | 1.5% |
| ggoegfkq020399 | 598 | 1.5% |
| ggoenebq086499 | 444 | 1.1% |
| ggoegdhc018299 | 433 | 1.1% |
| Other values (1103) | 27563 |
Most occurring characters
| Value | Count | Frequency (%) |
| G | 103945 | |
| 9 | 65110 | |
| E | 55623 | |
| 0 | 50244 | 9.0% |
| O | 43781 | 7.8% |
| 1 | 26606 | 4.7% |
| A | 24419 | 4.4% |
| B | 22933 | 4.1% |
| 7 | 18982 | 3.4% |
| 8 | 18190 | 3.2% |
| Other values (24) | 130593 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 320236 | |
| Decimal Number | 240190 |
Most frequent character per category
| Value | Count | Frequency (%) |
| G | 103945 | |
| E | 55623 | |
| O | 43781 | |
| A | 24419 | 7.6% |
| B | 22933 | 7.2% |
| N | 12674 | 4.0% |
| Q | 11990 | 3.7% |
| J | 8629 | 2.7% |
| H | 6110 | 1.9% |
| C | 5404 | 1.7% |
| Other values (14) | 24728 | 7.7% |
| Value | Count | Frequency (%) |
| 9 | 65110 | |
| 0 | 50244 | |
| 1 | 26606 | |
| 7 | 18982 | 7.9% |
| 8 | 18190 | 7.6% |
| 3 | 14847 | 6.2% |
| 4 | 13100 | 5.5% |
| 6 | 11313 | 4.7% |
| 2 | 11154 | 4.6% |
| 5 | 10644 | 4.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 320236 | |
| Common | 240190 |
Most frequent character per script
| Value | Count | Frequency (%) |
| G | 103945 | |
| E | 55623 | |
| O | 43781 | |
| A | 24419 | 7.6% |
| B | 22933 | 7.2% |
| N | 12674 | 4.0% |
| Q | 11990 | 3.7% |
| J | 8629 | 2.7% |
| H | 6110 | 1.9% |
| C | 5404 | 1.7% |
| Other values (14) | 24728 | 7.7% |
| Value | Count | Frequency (%) |
| 9 | 65110 | |
| 0 | 50244 | |
| 1 | 26606 | |
| 7 | 18982 | 7.9% |
| 8 | 18190 | 7.6% |
| 3 | 14847 | 6.2% |
| 4 | 13100 | 5.5% |
| 6 | 11313 | 4.7% |
| 2 | 11154 | 4.6% |
| 5 | 10644 | 4.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 560426 |
Most frequent character per block
| Value | Count | Frequency (%) |
| G | 103945 | |
| 9 | 65110 | |
| E | 55623 | |
| 0 | 50244 | 9.0% |
| O | 43781 | 7.8% |
| 1 | 26606 | 4.7% |
| A | 24419 | 4.4% |
| B | 22933 | 4.1% |
| 7 | 18982 | 3.4% |
| 8 | 18190 | 3.2% |
| Other values (24) | 130593 |
| Distinct | 403 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 625.5 KiB |
| Nest Learning Thermostat 3rd Gen-USA - Stainless Steel | 2660 |
|---|---|
| Nest Cam Outdoor Security Camera - USA | 2579 |
| Nest Cam Indoor Security Camera - USA | 2482 |
| Google Sunglasses | 1185 |
| Nest Protect Smoke + CO White Battery Alarm-USA | 1037 |
| Other values (398) |
Length
| Max length | 59 |
|---|---|
| Median length | 37 |
| Mean length | 34.23591716 |
| Min length | 8 |
Characters and Unicode
| Total characters | 1370498 |
|---|---|
| Distinct characters | 74 |
| Distinct categories | 10 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 6 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Nest Learning Thermostat 3rd Gen-USA - Stainless Steel |
|---|---|
| 2nd row | Nest Learning Thermostat 3rd Gen-USA - Stainless Steel |
| 3rd row | Nest Cam Outdoor Security Camera - USA |
| 4th row | Nest Protect Smoke + CO White Battery Alarm-USA |
| 5th row | Nest Learning Thermostat 3rd Gen-USA - Stainless Steel |
| Value | Count | Frequency (%) |
| Nest Learning Thermostat 3rd Gen-USA - Stainless Steel | 2660 | 6.6% |
| Nest Cam Outdoor Security Camera - USA | 2579 | 6.4% |
| Nest Cam Indoor Security Camera - USA | 2482 | 6.2% |
| Google Sunglasses | 1185 | 3.0% |
| Nest Protect Smoke + CO White Battery Alarm-USA | 1037 | 2.6% |
| Nest Protect Smoke + CO White Wired Alarm-USA | 812 | 2.0% |
| Nest Learning Thermostat 3rd Gen-USA - White | 809 | 2.0% |
| Google 22 oz Water Bottle | 679 | 1.7% |
| Nest Thermostat E - USA | 614 | 1.5% |
| Google Laptop and Cell Phone Stickers | 598 | 1.5% |
| Other values (393) | 26576 |
| Value | Count | Frequency (%) |
| 16442 | 7.2% | |
| 13328 | 5.8% | |
| nest | 12572 | 5.5% |
| tee | 8786 | 3.9% |
| men's | 7001 | 3.1% |
| usa | 6672 | 2.9% |
| sleeve | 6126 | 2.7% |
| cam | 5725 | 2.5% |
| short | 5548 | 2.4% |
| camera | 5169 | 2.3% |
| Other values (392) | 140633 |
Most occurring characters
| Value | Count | Frequency (%) |
| 188328 | 13.7% | |
| e | 172983 | 12.6% |
| o | 99814 | 7.3% |
| t | 81161 | 5.9% |
| a | 69444 | 5.1% |
| r | 68473 | 5.0% |
| l | 61658 | 4.5% |
| n | 51003 | 3.7% |
| S | 47393 | 3.5% |
| s | 44653 | 3.3% |
| Other values (64) | 485588 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 892197 | |
| Uppercase Letter | 241231 | 17.6% |
| Space Separator | 188328 | 13.7% |
| Dash Punctuation | 17510 | 1.3% |
| Decimal Number | 15067 | 1.1% |
| Other Punctuation | 13889 | 1.0% |
| Math Symbol | 1932 | 0.1% |
| Currency Symbol | 124 | < 0.1% |
| Open Punctuation | 110 | < 0.1% |
| Close Punctuation | 110 | < 0.1% |
Most frequent character per category
| Value | Count | Frequency (%) |
| S | 47393 | |
| G | 23249 | |
| C | 21458 | |
| T | 18833 | 7.8% |
| A | 18228 | 7.6% |
| N | 14732 | 6.1% |
| B | 13872 | 5.8% |
| U | 12575 | 5.2% |
| W | 10699 | 4.4% |
| M | 9491 | 3.9% |
| Other values (16) | 50701 |
| Value | Count | Frequency (%) |
| e | 172983 | |
| o | 99814 | |
| t | 81161 | |
| a | 69444 | 7.8% |
| r | 68473 | 7.7% |
| l | 61658 | 6.9% |
| n | 51003 | 5.7% |
| s | 44653 | 5.0% |
| i | 36408 | 4.1% |
| g | 31228 | 3.5% |
| Other values (16) | 175372 |
| Value | Count | Frequency (%) |
| 3 | 4080 | |
| 0 | 3154 | |
| 1 | 2665 | |
| 2 | 2592 | |
| 4 | 965 | 6.4% |
| 5 | 698 | 4.6% |
| 7 | 418 | 2.8% |
| 6 | 292 | 1.9% |
| 8 | 181 | 1.2% |
| 9 | 22 | 0.1% |
| Value | Count | Frequency (%) |
| ' | 10220 | |
| / | 1443 | 10.4% |
| % | 1337 | 9.6% |
| & | 652 | 4.7% |
| . | 124 | 0.9% |
| ; | 113 | 0.8% |
| Value | Count | Frequency (%) |
| 188328 |
| Value | Count | Frequency (%) |
| - | 17510 |
| Value | Count | Frequency (%) |
| + | 1932 |
| Value | Count | Frequency (%) |
| ( | 110 |
| Value | Count | Frequency (%) |
| ) | 110 |
| Value | Count | Frequency (%) |
| $ | 124 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1133428 | |
| Common | 237070 | 17.3% |
Most frequent character per script
| Value | Count | Frequency (%) |
| e | 172983 | |
| o | 99814 | 8.8% |
| t | 81161 | 7.2% |
| a | 69444 | 6.1% |
| r | 68473 | 6.0% |
| l | 61658 | 5.4% |
| n | 51003 | 4.5% |
| S | 47393 | 4.2% |
| s | 44653 | 3.9% |
| i | 36408 | 3.2% |
| Other values (42) | 400438 |
| Value | Count | Frequency (%) |
| 188328 | ||
| - | 17510 | 7.4% |
| ' | 10220 | 4.3% |
| 3 | 4080 | 1.7% |
| 0 | 3154 | 1.3% |
| 1 | 2665 | 1.1% |
| 2 | 2592 | 1.1% |
| + | 1932 | 0.8% |
| / | 1443 | 0.6% |
| % | 1337 | 0.6% |
| Other values (12) | 3809 | 1.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1370498 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 188328 | 13.7% | |
| e | 172983 | 12.6% |
| o | 99814 | 7.3% |
| t | 81161 | 5.9% |
| a | 69444 | 5.1% |
| r | 68473 | 5.0% |
| l | 61658 | 4.5% |
| n | 51003 | 3.7% |
| S | 47393 | 3.5% |
| s | 44653 | 3.3% |
| Other values (64) | 485588 |
| Distinct | 20 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 625.5 KiB |
| Apparel | |
|---|---|
| Nest-USA | |
| Office | |
| Drinkware | |
| Lifestyle | |
| Other values (15) |
Length
| Max length | 20 |
|---|---|
| Median length | 7 |
| Mean length | 7.383727611 |
| Min length | 3 |
Characters and Unicode
| Total characters | 295578 |
|---|---|
| Distinct characters | 38 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Nest-USA |
|---|---|
| 2nd row | Nest-USA |
| 3rd row | Nest-USA |
| 4th row | Nest-USA |
| 5th row | Nest-USA |
| Value | Count | Frequency (%) |
| Apparel | 13696 | |
| Nest-USA | 10708 | |
| Office | 4872 | 12.2% |
| Drinkware | 2614 | 6.5% |
| Lifestyle | 2377 | 5.9% |
| Nest | 1621 | 4.0% |
| Bags | 1400 | 3.5% |
| Headgear | 581 | 1.5% |
| Notebooks & Journals | 568 | 1.4% |
| Waze | 422 | 1.1% |
| Other values (10) | 1172 | 2.9% |
| Value | Count | Frequency (%) |
| apparel | 13696 | |
| nest-usa | 10708 | |
| office | 4872 | 11.8% |
| drinkware | 2614 | 6.3% |
| lifestyle | 2377 | 5.8% |
| nest | 1621 | 3.9% |
| bags | 1435 | 3.5% |
| headgear | 581 | 1.4% |
| 568 | 1.4% | |
| journals | 568 | 1.4% |
| Other values (13) | 2286 | 5.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 41489 | |
| p | 27464 | 9.3% |
| A | 24601 | 8.3% |
| a | 20982 | 7.1% |
| r | 20517 | 6.9% |
| s | 18595 | 6.3% |
| l | 16931 | 5.7% |
| t | 16063 | 5.4% |
| N | 13140 | 4.4% |
| f | 12245 | 4.1% |
| Other values (28) | 83551 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 209639 | |
| Uppercase Letter | 73125 | 24.7% |
| Dash Punctuation | 10951 | 3.7% |
| Space Separator | 1295 | 0.4% |
| Other Punctuation | 568 | 0.2% |
Most frequent character per category
| Value | Count | Frequency (%) |
| e | 41489 | |
| p | 27464 | |
| a | 20982 | |
| r | 20517 | |
| s | 18595 | |
| l | 16931 | |
| t | 16063 | 7.7% |
| f | 12245 | 5.8% |
| i | 10184 | 4.9% |
| c | 5344 | 2.5% |
| Other values (10) | 19825 |
| Value | Count | Frequency (%) |
| A | 24601 | |
| N | 13140 | |
| U | 10708 | |
| S | 10708 | |
| O | 4872 | 6.7% |
| D | 2614 | 3.6% |
| L | 2377 | 3.3% |
| B | 1718 | 2.3% |
| H | 669 | 0.9% |
| J | 568 | 0.8% |
| Other values (5) | 1150 | 1.6% |
| Value | Count | Frequency (%) |
| - | 10951 |
| Value | Count | Frequency (%) |
| 1295 |
| Value | Count | Frequency (%) |
| & | 568 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 282764 | |
| Common | 12814 | 4.3% |
Most frequent character per script
| Value | Count | Frequency (%) |
| e | 41489 | |
| p | 27464 | 9.7% |
| A | 24601 | 8.7% |
| a | 20982 | 7.4% |
| r | 20517 | 7.3% |
| s | 18595 | 6.6% |
| l | 16931 | 6.0% |
| t | 16063 | 5.7% |
| N | 13140 | 4.6% |
| f | 12245 | 4.3% |
| Other values (25) | 70737 |
| Value | Count | Frequency (%) |
| - | 10951 | |
| 1295 | 10.1% | |
| & | 568 | 4.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 295578 |
Most frequent character per block
| Value | Count | Frequency (%) |
| e | 41489 | |
| p | 27464 | 9.3% |
| A | 24601 | 8.3% |
| a | 20982 | 7.1% |
| r | 20517 | 6.9% |
| s | 18595 | 6.3% |
| l | 16931 | 5.7% |
| t | 16063 | 5.4% |
| N | 13140 | 4.4% |
| f | 12245 | 4.1% |
| Other values (28) | 83551 |
| Distinct | 127 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.415378082 |
|---|---|
| Minimum | 1 |
| Maximum | 900 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 625.5 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 15 |
| Maximum | 900 |
| Range | 899 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 20.26703266 |
|---|---|
| Coefficient of variation (CV) | 4.590101297 |
| Kurtosis | 584.0241908 |
| Mean | 4.415378082 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 20.23009436 |
| Sum | 176752 |
| Variance | 410.7526129 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 26781 | |
| 2 | 5333 | 13.3% |
| 3 | 1756 | 4.4% |
| 5 | 1283 | 3.2% |
| 4 | 929 | 2.3% |
| 10 | 779 | 1.9% |
| 20 | 426 | 1.1% |
| 6 | 317 | 0.8% |
| 15 | 279 | 0.7% |
| 25 | 221 | 0.6% |
| Other values (117) | 1927 | 4.8% |
| Value | Count | Frequency (%) |
| 1 | 26781 | |
| 2 | 5333 | 13.3% |
| 3 | 1756 | 4.4% |
| 4 | 929 | 2.3% |
| 5 | 1283 | 3.2% |
| Value | Count | Frequency (%) |
| 900 | 1 | < 0.1% |
| 825 | 2 | |
| 791 | 1 | < 0.1% |
| 750 | 1 | < 0.1% |
| 600 | 3 |
Avg_Price
Real number (ℝ≥0)
| Distinct | 506 |
|---|---|
| Distinct (%) | 1.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 52.41615373 |
|---|---|
| Minimum | 0.39 |
| Maximum | 355.74 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 625.5 KiB |
Quantile statistics
| Minimum | 0.39 |
|---|---|
| 5-th percentile | 1.99 |
| Q1 | 5.7 |
| median | 16.99 |
| Q3 | 119 |
| 95-th percentile | 151.88 |
| Maximum | 355.74 |
| Range | 355.35 |
| Interquartile range (IQR) | 113.3 |
Descriptive statistics
| Standard deviation | 63.92781142 |
|---|---|
| Coefficient of variation (CV) | 1.21962042 |
| Kurtosis | 3.306880462 |
| Mean | 52.41615373 |
| Median Absolute Deviation (MAD) | 14.19 |
| Skewness | 1.62114731 |
| Sum | 2098271.05 |
| Variance | 4086.765073 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 119 | 4041 | 10.1% |
| 149 | 2982 | 7.4% |
| 79 | 1509 | 3.8% |
| 13.59 | 1160 | 2.9% |
| 2.99 | 970 | 2.4% |
| 2.39 | 955 | 2.4% |
| 16.99 | 843 | 2.1% |
| 15.19 | 771 | 1.9% |
| 1.99 | 745 | 1.9% |
| 3.99 | 736 | 1.8% |
| Other values (496) | 25319 |
| Value | Count | Frequency (%) |
| 0.39 | 1 | < 0.1% |
| 0.4 | 35 | |
| 0.41 | 11 | < 0.1% |
| 0.5 | 26 | |
| 0.51 | 20 |
| Value | Count | Frequency (%) |
| 355.74 | 129 | |
| 349 | 245 | |
| 279 | 109 | |
| 274.19 | 1 | < 0.1% |
| 269 | 1 | < 0.1% |
Delivery_Charges
Real number (ℝ≥0)
| Distinct | 241 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10.44624541 |
|---|---|
| Minimum | 0 |
| Maximum | 521.36 |
| Zeros | 127 |
| Zeros (%) | 0.3% |
| Memory size | 625.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 6 |
| Q1 | 6 |
| median | 6 |
| Q3 | 6.5 |
| 95-th percentile | 26.43 |
| Maximum | 521.36 |
| Range | 521.36 |
| Interquartile range (IQR) | 0.5 |
Descriptive statistics
| Standard deviation | 18.43257371 |
|---|---|
| Coefficient of variation (CV) | 1.764516627 |
| Kurtosis | 220.1261797 |
| Mean | 10.44624541 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 11.89477448 |
| Sum | 418173.65 |
| Variance | 339.7597736 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 6 | 20363 | |
| 6.5 | 11923 | |
| 12.99 | 1949 | 4.9% |
| 19.99 | 747 | 1.9% |
| 12.48 | 555 | 1.4% |
| 12.91 | 328 | 0.8% |
| 8.7 | 250 | 0.6% |
| 0 | 127 | 0.3% |
| 18.47 | 111 | 0.3% |
| 75 | 87 | 0.2% |
| Other values (231) | 3591 | 9.0% |
| Value | Count | Frequency (%) |
| 0 | 127 | 0.3% |
| 6 | 20363 | |
| 6.46 | 9 | < 0.1% |
| 6.48 | 21 | 0.1% |
| 6.5 | 11923 |
| Value | Count | Frequency (%) |
| 521.36 | 1 | < 0.1% |
| 492.84 | 10 | |
| 422.24 | 4 | < 0.1% |
| 354 | 3 | < 0.1% |
| 323.47 | 4 | < 0.1% |
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 625.5 KiB |
| Clicked | |
|---|---|
| Used | |
| Not Used |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 6.135270166 |
| Min length | 4 |
Characters and Unicode
| Total characters | 245601 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Used |
|---|---|
| 2nd row | Used |
| 3rd row | Not Used |
| 4th row | Clicked |
| 5th row | Clicked |
| Value | Count | Frequency (%) |
| Clicked | 20359 | |
| Used | 13572 | |
| Not Used | 6100 | 15.2% |
| Value | Count | Frequency (%) |
| clicked | 20359 | |
| used | 19672 | |
| not | 6100 | 13.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 40031 | |
| d | 40031 | |
| C | 20359 | |
| l | 20359 | |
| i | 20359 | |
| c | 20359 | |
| k | 20359 | |
| U | 19672 | |
| s | 19672 | |
| N | 6100 | 2.5% |
| Other values (3) | 18300 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 193370 | |
| Uppercase Letter | 46131 | 18.8% |
| Space Separator | 6100 | 2.5% |
Most frequent character per category
| Value | Count | Frequency (%) |
| e | 40031 | |
| d | 40031 | |
| l | 20359 | |
| i | 20359 | |
| c | 20359 | |
| k | 20359 | |
| s | 19672 | |
| o | 6100 | 3.2% |
| t | 6100 | 3.2% |
| Value | Count | Frequency (%) |
| C | 20359 | |
| U | 19672 | |
| N | 6100 | 13.2% |
| Value | Count | Frequency (%) |
| 6100 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 239501 | |
| Common | 6100 | 2.5% |
Most frequent character per script
| Value | Count | Frequency (%) |
| e | 40031 | |
| d | 40031 | |
| C | 20359 | |
| l | 20359 | |
| i | 20359 | |
| c | 20359 | |
| k | 20359 | |
| U | 19672 | |
| s | 19672 | |
| N | 6100 | 2.5% |
| Other values (2) | 12200 | 5.1% |
| Value | Count | Frequency (%) |
| 6100 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 245601 |
Most frequent character per block
| Value | Count | Frequency (%) |
| e | 40031 | |
| d | 40031 | |
| C | 20359 | |
| l | 20359 | |
| i | 20359 | |
| c | 20359 | |
| k | 20359 | |
| U | 19672 | |
| s | 19672 | |
| N | 6100 | 2.5% |
| Other values (3) | 18300 |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 625.5 KiB |
| F | |
|---|---|
| M |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 40031 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | M |
|---|---|
| 2nd row | M |
| 3rd row | M |
| 4th row | M |
| 5th row | M |
| Value | Count | Frequency (%) |
| F | 24966 | |
| M | 15065 |
| Value | Count | Frequency (%) |
| f | 24966 | |
| m | 15065 |
Most occurring characters
| Value | Count | Frequency (%) |
| F | 24966 | |
| M | 15065 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 40031 |
Most frequent character per category
| Value | Count | Frequency (%) |
| F | 24966 | |
| M | 15065 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 40031 |
Most frequent character per script
| Value | Count | Frequency (%) |
| F | 24966 | |
| M | 15065 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 40031 |
Most frequent character per block
| Value | Count | Frequency (%) |
| F | 24966 | |
| M | 15065 |
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 625.5 KiB |
| Chicago | |
|---|---|
| California | |
| New York | |
| New Jersey | |
| Washington DC |
Length
| Max length | 13 |
|---|---|
| Median length | 8 |
| Mean length | 8.645000125 |
| Min length | 7 |
Characters and Unicode
| Total characters | 346068 |
|---|---|
| Distinct characters | 23 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Chicago |
|---|---|
| 2nd row | Chicago |
| 3rd row | Chicago |
| 4th row | Chicago |
| 5th row | Chicago |
| Value | Count | Frequency (%) |
| Chicago | 14584 | |
| California | 11805 | |
| New York | 8374 | |
| New Jersey | 3182 | 7.9% |
| Washington DC | 2086 | 5.2% |
| Value | Count | Frequency (%) |
| chicago | 14584 | |
| california | 11805 | |
| new | 11556 | |
| york | 8374 | |
| jersey | 3182 | 5.9% |
| dc | 2086 | 3.9% |
| washington | 2086 | 3.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 40280 | |
| a | 40280 | |
| o | 36849 | 10.6% |
| C | 28475 | 8.2% |
| r | 23361 | 6.8% |
| e | 17920 | 5.2% |
| h | 16670 | 4.8% |
| g | 16670 | 4.8% |
| n | 15977 | 4.6% |
| c | 14584 | 4.2% |
| Other values (13) | 95002 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 276667 | |
| Uppercase Letter | 55759 | 16.1% |
| Space Separator | 13642 | 3.9% |
Most frequent character per category
| Value | Count | Frequency (%) |
| i | 40280 | |
| a | 40280 | |
| o | 36849 | |
| r | 23361 | |
| e | 17920 | |
| h | 16670 | 6.0% |
| g | 16670 | 6.0% |
| n | 15977 | 5.8% |
| c | 14584 | 5.3% |
| l | 11805 | 4.3% |
| Other values (6) | 42271 |
| Value | Count | Frequency (%) |
| C | 28475 | |
| N | 11556 | |
| Y | 8374 | 15.0% |
| J | 3182 | 5.7% |
| W | 2086 | 3.7% |
| D | 2086 | 3.7% |
| Value | Count | Frequency (%) |
| 13642 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 332426 | |
| Common | 13642 | 3.9% |
Most frequent character per script
| Value | Count | Frequency (%) |
| i | 40280 | |
| a | 40280 | |
| o | 36849 | |
| C | 28475 | 8.6% |
| r | 23361 | 7.0% |
| e | 17920 | 5.4% |
| h | 16670 | 5.0% |
| g | 16670 | 5.0% |
| n | 15977 | 4.8% |
| c | 14584 | 4.4% |
| Other values (12) | 81360 |
| Value | Count | Frequency (%) |
| 13642 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 346068 |
Most frequent character per block
| Value | Count | Frequency (%) |
| i | 40280 | |
| a | 40280 | |
| o | 36849 | 10.6% |
| C | 28475 | 8.2% |
| r | 23361 | 6.8% |
| e | 17920 | 5.2% |
| h | 16670 | 4.8% |
| g | 16670 | 4.8% |
| n | 15977 | 4.6% |
| c | 14584 | 4.2% |
| Other values (13) | 95002 |
Tenure_Months
Real number (ℝ≥0)
| Distinct | 49 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 25.9437436 |
|---|---|
| Minimum | 2 |
| Maximum | 50 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 625.5 KiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 5 |
| Q1 | 14 |
| median | 27 |
| Q3 | 37 |
| 95-th percentile | 46 |
| Maximum | 50 |
| Range | 48 |
| Interquartile range (IQR) | 23 |
Descriptive statistics
| Standard deviation | 13.33459876 |
|---|---|
| Coefficient of variation (CV) | 0.5139812884 |
| Kurtosis | -1.095924454 |
| Mean | 25.9437436 |
| Median Absolute Deviation (MAD) | 11 |
| Skewness | -0.0922319672 |
| Sum | 1038554 |
| Variance | 177.8115241 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 40 | 1653 | 4.1% |
| 25 | 1632 | 4.1% |
| 30 | 1493 | 3.7% |
| 34 | 1434 | 3.6% |
| 5 | 1421 | 3.5% |
| 33 | 1410 | 3.5% |
| 21 | 1263 | 3.2% |
| 28 | 1242 | 3.1% |
| 45 | 1176 | 2.9% |
| 10 | 1146 | 2.9% |
| Other values (39) | 26161 |
| Value | Count | Frequency (%) |
| 2 | 467 | 1.2% |
| 3 | 561 | 1.4% |
| 4 | 706 | |
| 5 | 1421 | |
| 6 | 1034 |
| Value | Count | Frequency (%) |
| 50 | 443 | |
| 49 | 412 | |
| 48 | 633 | |
| 47 | 349 | |
| 46 | 610 |
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 625.5 KiB |
| 0.18 | |
|---|---|
| 0.1 | |
| 0.05 | |
| 0.12 | 88 |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 3.59603807 |
| Min length | 3 |
Characters and Unicode
| Total characters | 143953 |
|---|---|
| Distinct characters | 6 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.1 |
|---|---|
| 2nd row | 0.1 |
| 3rd row | 0.1 |
| 4th row | 0.1 |
| 5th row | 0.1 |
| Value | Count | Frequency (%) |
| 0.18 | 20667 | |
| 0.1 | 16171 | |
| 0.05 | 3105 | 7.8% |
| 0.12 | 88 | 0.2% |
| Value | Count | Frequency (%) |
| 0.18 | 20667 | |
| 0.1 | 16171 | |
| 0.05 | 3105 | 7.8% |
| 0.12 | 88 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 43136 | |
| . | 40031 | |
| 1 | 36926 | |
| 8 | 20667 | |
| 5 | 3105 | 2.2% |
| 2 | 88 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 103922 | |
| Other Punctuation | 40031 | 27.8% |
Most frequent character per category
| Value | Count | Frequency (%) |
| 0 | 43136 | |
| 1 | 36926 | |
| 8 | 20667 | |
| 5 | 3105 | 3.0% |
| 2 | 88 | 0.1% |
| Value | Count | Frequency (%) |
| . | 40031 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 143953 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 0 | 43136 | |
| . | 40031 | |
| 1 | 36926 | |
| 8 | 20667 | |
| 5 | 3105 | 2.2% |
| 2 | 88 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 143953 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 0 | 43136 | |
| . | 40031 | |
| 1 | 36926 | |
| 8 | 20667 | |
| 5 | 3105 | 2.2% |
| 2 | 88 | 0.1% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 625.5 KiB |
| Existing | |
|---|---|
| New |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 5.609352752 |
| Min length | 3 |
Characters and Unicode
| Total characters | 224548 |
|---|---|
| Distinct characters | 10 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | New |
|---|---|
| 2nd row | New |
| 3rd row | New |
| 4th row | New |
| 5th row | New |
| Value | Count | Frequency (%) |
| Existing | 20891 | |
| New | 19140 |
| Value | Count | Frequency (%) |
| existing | 20891 | |
| new | 19140 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 41782 | |
| E | 20891 | |
| x | 20891 | |
| s | 20891 | |
| t | 20891 | |
| n | 20891 | |
| g | 20891 | |
| N | 19140 | |
| e | 19140 | |
| w | 19140 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 184517 | |
| Uppercase Letter | 40031 | 17.8% |
Most frequent character per category
| Value | Count | Frequency (%) |
| i | 41782 | |
| x | 20891 | |
| s | 20891 | |
| t | 20891 | |
| n | 20891 | |
| g | 20891 | |
| e | 19140 | |
| w | 19140 |
| Value | Count | Frequency (%) |
| E | 20891 | |
| N | 19140 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 224548 |
Most frequent character per script
| Value | Count | Frequency (%) |
| i | 41782 | |
| E | 20891 | |
| x | 20891 | |
| s | 20891 | |
| t | 20891 | |
| n | 20891 | |
| g | 20891 | |
| N | 19140 | |
| e | 19140 | |
| w | 19140 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 224548 |
Most frequent character per block
| Value | Count | Frequency (%) |
| i | 41782 | |
| E | 20891 | |
| x | 20891 | |
| s | 20891 | |
| t | 20891 | |
| n | 20891 | |
| g | 20891 | |
| N | 19140 | |
| e | 19140 | |
| w | 19140 |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 625.5 KiB |
| 2019 |
|---|
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Characters and Unicode
| Total characters | 160124 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2019 |
|---|---|
| 2nd row | 2019 |
| 3rd row | 2019 |
| 4th row | 2019 |
| 5th row | 2019 |
| Value | Count | Frequency (%) |
| 2019 | 40031 |
| Value | Count | Frequency (%) |
| 2019 | 40031 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 40031 | |
| 0 | 40031 | |
| 1 | 40031 | |
| 9 | 40031 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 160124 |
Most frequent character per category
| Value | Count | Frequency (%) |
| 2 | 40031 | |
| 0 | 40031 | |
| 1 | 40031 | |
| 9 | 40031 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 160124 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 2 | 40031 | |
| 0 | 40031 | |
| 1 | 40031 | |
| 9 | 40031 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 160124 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 2 | 40031 | |
| 0 | 40031 | |
| 1 | 40031 | |
| 9 | 40031 |
| Distinct | 52 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 26.83402863 |
|---|---|
| Minimum | 1 |
| Maximum | 52 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 625.5 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 4 |
| Q1 | 15 |
| median | 28 |
| Q3 | 38 |
| 95-th percentile | 50 |
| Maximum | 52 |
| Range | 51 |
| Interquartile range (IQR) | 23 |
Descriptive statistics
| Standard deviation | 14.32169504 |
|---|---|
| Coefficient of variation (CV) | 0.5337139362 |
| Kurtosis | -1.077566579 |
| Mean | 26.83402863 |
| Median Absolute Deviation (MAD) | 12 |
| Skewness | -0.03034840189 |
| Sum | 1074193 |
| Variance | 205.1109489 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 29 | 1234 | 3.1% |
| 31 | 1192 | 3.0% |
| 28 | 1076 | 2.7% |
| 34 | 1046 | 2.6% |
| 33 | 1039 | 2.6% |
| 35 | 1027 | 2.6% |
| 30 | 1026 | 2.6% |
| 32 | 997 | 2.5% |
| 49 | 961 | 2.4% |
| 50 | 930 | 2.3% |
| Other values (42) | 29503 |
| Value | Count | Frequency (%) |
| 1 | 631 | |
| 2 | 585 | |
| 3 | 637 | |
| 4 | 632 | |
| 5 | 568 |
| Value | Count | Frequency (%) |
| 52 | 428 | |
| 51 | 772 | |
| 50 | 930 | |
| 49 | 961 | |
| 48 | 670 |
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.576078539 |
|---|---|
| Minimum | 1 |
| Maximum | 12 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 625.5 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 4 |
| median | 7 |
| Q3 | 9 |
| 95-th percentile | 12 |
| Maximum | 12 |
| Range | 11 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 3.280530431 |
|---|---|
| Coefficient of variation (CV) | 0.4988581587 |
| Kurtosis | -1.06641822 |
| Mean | 6.576078539 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | -0.0295538037 |
| Sum | 263247 |
| Variance | 10.76187991 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 8 | 4654 | |
| 7 | 4567 | |
| 5 | 3467 | |
| 3 | 3394 | |
| 4 | 3264 | |
| 12 | 3182 | |
| 10 | 3116 | |
| 6 | 3081 | |
| 9 | 2991 | |
| 2 | 2819 | |
| Other values (2) | 5496 |
| Value | Count | Frequency (%) |
| 1 | 2737 | |
| 2 | 2819 | |
| 3 | 3394 | |
| 4 | 3264 | |
| 5 | 3467 |
| Value | Count | Frequency (%) |
| 12 | 3182 | |
| 11 | 2759 | |
| 10 | 3116 | |
| 9 | 2991 | |
| 8 | 4654 |
| Distinct | 46 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 625.5 KiB |
| SALE20 | |
|---|---|
| SALE10 | |
| SALE30 | |
| ELEC10 | |
| ELEC20 | |
| Other values (41) |
Length
| Max length | 9 |
|---|---|
| Median length | 6 |
| Mean length | 5.866378557 |
| Min length | 4 |
Characters and Unicode
| Total characters | 234837 |
|---|---|
| Distinct characters | 30 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | ELEC10 |
|---|---|
| 2nd row | ELEC10 |
| 3rd row | ELEC10 |
| 4th row | ELEC10 |
| 5th row | ELEC10 |
| Value | Count | Frequency (%) |
| SALE20 | 4828 | |
| SALE10 | 4513 | |
| SALE30 | 4355 | |
| ELEC10 | 3751 | |
| ELEC20 | 3511 | |
| ELEC30 | 3446 | 8.6% |
| EXTRA10 | 1814 | 4.5% |
| OFF10 | 1746 | 4.4% |
| EXTRA20 | 1725 | 4.3% |
| OFF20 | 1658 | 4.1% |
| Other values (36) | 8684 |
| Value | Count | Frequency (%) |
| sale20 | 4828 | |
| sale10 | 4513 | |
| sale30 | 4355 | |
| elec10 | 3751 | |
| elec20 | 3511 | 8.7% |
| elec30 | 3446 | 8.5% |
| extra10 | 1814 | 4.5% |
| off10 | 1746 | 4.3% |
| extra20 | 1725 | 4.3% |
| off20 | 1658 | 4.1% |
| Other values (37) | 8993 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 42727 | |
| 0 | 39722 | |
| L | 24404 | |
| A | 21108 | |
| S | 13696 | 5.8% |
| 2 | 13596 | 5.8% |
| 1 | 13559 | 5.8% |
| 3 | 12567 | 5.4% |
| C | 11712 | 5.0% |
| F | 9744 | 4.1% |
| Other values (20) | 32002 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 153230 | |
| Decimal Number | 79444 | |
| Lowercase Letter | 1854 | 0.8% |
| Space Separator | 309 | 0.1% |
Most frequent character per category
| Value | Count | Frequency (%) |
| E | 42727 | |
| L | 24404 | |
| A | 21108 | |
| S | 13696 | 8.9% |
| C | 11712 | 7.6% |
| F | 9744 | 6.4% |
| O | 6360 | 4.2% |
| R | 5572 | 3.6% |
| T | 5202 | 3.4% |
| X | 4991 | 3.3% |
| Other values (11) | 7714 | 5.0% |
| Value | Count | Frequency (%) |
| 0 | 39722 | |
| 2 | 13596 | 17.1% |
| 1 | 13559 | 17.1% |
| 3 | 12567 | 15.8% |
| Value | Count | Frequency (%) |
| o | 927 | |
| u | 309 | 16.7% |
| p | 309 | 16.7% |
| n | 309 | 16.7% |
| Value | Count | Frequency (%) |
| 309 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 155084 | |
| Common | 79753 |
Most frequent character per script
| Value | Count | Frequency (%) |
| E | 42727 | |
| L | 24404 | |
| A | 21108 | |
| S | 13696 | 8.8% |
| C | 11712 | 7.6% |
| F | 9744 | 6.3% |
| O | 6360 | 4.1% |
| R | 5572 | 3.6% |
| T | 5202 | 3.4% |
| X | 4991 | 3.2% |
| Other values (15) | 9568 | 6.2% |
| Value | Count | Frequency (%) |
| 0 | 39722 | |
| 2 | 13596 | 17.0% |
| 1 | 13559 | 17.0% |
| 3 | 12567 | 15.8% |
| 309 | 0.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 234837 |
Most frequent character per block
| Value | Count | Frequency (%) |
| E | 42727 | |
| 0 | 39722 | |
| L | 24404 | |
| A | 21108 | |
| S | 13696 | 5.8% |
| 2 | 13596 | 5.8% |
| 1 | 13559 | 5.8% |
| 3 | 12567 | 5.4% |
| C | 11712 | 5.0% |
| F | 9744 | 4.1% |
| Other values (20) | 32002 |
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 625.5 KiB |
| 20.0 | |
|---|---|
| 10.0 | |
| 30.0 | |
| 0.0 | 309 |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 3.992280982 |
| Min length | 3 |
Characters and Unicode
| Total characters | 159815 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 10.0 |
|---|---|
| 2nd row | 10.0 |
| 3rd row | 10.0 |
| 4th row | 10.0 |
| 5th row | 10.0 |
| Value | Count | Frequency (%) |
| 20.0 | 13596 | |
| 10.0 | 13559 | |
| 30.0 | 12567 | |
| 0.0 | 309 | 0.8% |
| Value | Count | Frequency (%) |
| 20.0 | 13596 | |
| 10.0 | 13559 | |
| 30.0 | 12567 | |
| 0.0 | 309 | 0.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 80062 | |
| . | 40031 | |
| 2 | 13596 | 8.5% |
| 1 | 13559 | 8.5% |
| 3 | 12567 | 7.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 119784 | |
| Other Punctuation | 40031 | 25.0% |
Most frequent character per category
| Value | Count | Frequency (%) |
| 0 | 80062 | |
| 2 | 13596 | 11.4% |
| 1 | 13559 | 11.3% |
| 3 | 12567 | 10.5% |
| Value | Count | Frequency (%) |
| . | 40031 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 159815 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 0 | 80062 | |
| . | 40031 | |
| 2 | 13596 | 8.5% |
| 1 | 13559 | 8.5% |
| 3 | 12567 | 7.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 159815 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 0 | 80062 | |
| . | 40031 | |
| 2 | 13596 | 8.5% |
| 1 | 13559 | 8.5% |
| 3 | 12567 | 7.9% |
Marketing_Date
Date
| Distinct | 365 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 625.5 KiB |
| Minimum | 2019-01-01 00:00:00 |
|---|---|
| Maximum | 2019-12-31 00:00:00 |
Offline_Spend
Real number (ℝ≥0)
| Distinct | 11 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2812.093128 |
|---|---|
| Minimum | 500 |
| Maximum | 5000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 625.5 KiB |
Quantile statistics
| Minimum | 500 |
|---|---|
| 5-th percentile | 1000 |
| Q1 | 2500 |
| median | 3000 |
| Q3 | 3500 |
| 95-th percentile | 4000 |
| Maximum | 5000 |
| Range | 4500 |
| Interquartile range (IQR) | 1000 |
Descriptive statistics
| Standard deviation | 927.8449336 |
|---|---|
| Coefficient of variation (CV) | 0.329948153 |
| Kurtosis | 0.09180464913 |
| Mean | 2812.093128 |
| Median Absolute Deviation (MAD) | 500 |
| Skewness | -0.3114640595 |
| Sum | 112570900 |
| Variance | 860896.2208 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 3000 | 10027 | |
| 2500 | 7699 | |
| 3500 | 6955 | |
| 2000 | 5038 | |
| 4000 | 3680 | 9.2% |
| 1500 | 1882 | 4.7% |
| 1000 | 1377 | 3.4% |
| 4500 | 1354 | 3.4% |
| 500 | 826 | 2.1% |
| 700 | 597 | 1.5% |
| Value | Count | Frequency (%) |
| 500 | 826 | 2.1% |
| 700 | 597 | 1.5% |
| 1000 | 1377 | 3.4% |
| 1500 | 1882 | 4.7% |
| 2000 | 5038 |
| Value | Count | Frequency (%) |
| 5000 | 596 | 1.5% |
| 4500 | 1354 | 3.4% |
| 4000 | 3680 | 9.2% |
| 3500 | 6955 | |
| 3000 | 10027 |
Online_Spend
Real number (ℝ≥0)
| Distinct | 365 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1873.168574 |
|---|---|
| Minimum | 320.25 |
| Maximum | 4556.93 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 625.5 KiB |
Quantile statistics
| Minimum | 320.25 |
|---|---|
| 5-th percentile | 687.86 |
| Q1 | 1196.03 |
| median | 1801.66 |
| Q3 | 2424.5 |
| 95-th percentile | 3396.14 |
| Maximum | 4556.93 |
| Range | 4236.68 |
| Interquartile range (IQR) | 1228.47 |
Descriptive statistics
| Standard deviation | 812.5039497 |
|---|---|
| Coefficient of variation (CV) | 0.4337591187 |
| Kurtosis | -0.2152680107 |
| Mean | 1873.168574 |
| Median Absolute Deviation (MAD) | 609.73 |
| Skewness | 0.467008992 |
| Sum | 74984811.18 |
| Variance | 660162.6683 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 1692.8 | 294 | 0.7% |
| 2819.58 | 276 | 0.7% |
| 985.28 | 260 | 0.6% |
| 1331.1 | 258 | 0.6% |
| 1108.88 | 241 | 0.6% |
| 1542.81 | 224 | 0.6% |
| 1946.56 | 218 | 0.5% |
| 1128.09 | 218 | 0.5% |
| 2489.36 | 216 | 0.5% |
| 1901.56 | 215 | 0.5% |
| Other values (355) | 37611 |
| Value | Count | Frequency (%) |
| 320.25 | 130 | |
| 417.73 | 132 | |
| 465.4 | 43 | 0.1% |
| 478.27 | 131 | |
| 484.9 | 113 |
| Value | Count | Frequency (%) |
| 4556.93 | 83 | |
| 4349.02 | 74 | |
| 4055.3 | 110 | |
| 4019.93 | 61 | |
| 3897.2 | 90 |
| Distinct | 8077 |
|---|---|
| Distinct (%) | 20.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 192.2484246 |
|---|---|
| Minimum | 4.12 |
| Maximum | 115686 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 625.5 KiB |
Quantile statistics
| Minimum | 4.12 |
|---|---|
| 5-th percentile | 8.8 |
| Q1 | 19.59 |
| median | 51.54 |
| Q3 | 155 |
| 95-th percentile | 427.75 |
| Maximum | 115686 |
| Range | 115681.88 |
| Interquartile range (IQR) | 135.41 |
Descriptive statistics
| Standard deviation | 1450.441573 |
|---|---|
| Coefficient of variation (CV) | 7.544621372 |
| Kurtosis | 2484.930398 |
| Mean | 192.2484246 |
| Median Absolute Deviation (MAD) | 40.66 |
| Skewness | 42.29610585 |
| Sum | 7695896.685 |
| Variance | 2103780.756 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 125 | 1171 | 2.9% |
| 155 | 971 | 2.4% |
| 125.5 | 508 | 1.3% |
| 250 | 411 | 1.0% |
| 19.59 | 397 | 1.0% |
| 155.5 | 388 | 1.0% |
| 85 | 279 | 0.7% |
| 16.63 | 266 | 0.7% |
| 21.99 | 255 | 0.6% |
| 22.99 | 235 | 0.6% |
| Other values (8067) | 35150 |
| Value | Count | Frequency (%) |
| 4.12 | 1 | < 0.1% |
| 4.185 | 1 | < 0.1% |
| 4.557 | 6 | |
| 4.65 | 1 | < 0.1% |
| 4.753 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 115686 | 1 | |
| 109452.83 | 1 | |
| 78582.08 | 1 | |
| 65250 | 1 | |
| 60154.884 | 1 |
revenue_per_customer
Real number (ℝ≥0)
| Distinct | 734 |
|---|---|
| Distinct (%) | 1.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 27753.46303 |
|---|---|
| Minimum | 28.863 |
| Maximum | 257792.206 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 625.5 KiB |
Quantile statistics
| Minimum | 28.863 |
|---|---|
| 5-th percentile | 1853.213 |
| Q1 | 5094.736 |
| median | 10783.312 |
| Q3 | 24715.743 |
| 95-th percentile | 118511.709 |
| Maximum | 257792.206 |
| Range | 257763.343 |
| Interquartile range (IQR) | 19621.007 |
Descriptive statistics
| Standard deviation | 46116.52219 |
|---|---|
| Coefficient of variation (CV) | 1.661649292 |
| Kurtosis | 10.84155146 |
| Mean | 27753.46303 |
| Median Absolute Deviation (MAD) | 6815.056 |
| Skewness | 3.170178368 |
| Sum | 1110998879 |
| Variance | 2126733619 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 244509.716 | 695 | 1.7% |
| 106406.408 | 587 | 1.5% |
| 118511.709 | 575 | 1.4% |
| 113573.419 | 572 | 1.4% |
| 134516.474 | 523 | 1.3% |
| 36400.311 | 366 | 0.9% |
| 53675.804 | 315 | 0.8% |
| 60381.669 | 297 | 0.7% |
| 25601.119 | 290 | 0.7% |
| 36940.336 | 261 | 0.7% |
| Other values (724) | 35550 |
| Value | Count | Frequency (%) |
| 28.863 | 2 | |
| 62.22 | 2 | |
| 63.296 | 4 | |
| 71.18 | 2 | |
| 82.192 | 2 |
| Value | Count | Frequency (%) |
| 257792.206 | 147 | 0.4% |
| 245611.693 | 157 | 0.4% |
| 244509.716 | 695 | |
| 137971.419 | 163 | 0.4% |
| 134516.474 | 523 |
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 625.5 KiB |
| High_value | |
|---|---|
| Medium_value | |
| Low_value |
Length
| Max length | 12 |
|---|---|
| Median length | 10 |
| Mean length | 10.45304889 |
| Min length | 9 |
Characters and Unicode
| Total characters | 418446 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | High_value |
|---|---|
| 2nd row | High_value |
| 3rd row | High_value |
| 4th row | High_value |
| 5th row | High_value |
| Value | Count | Frequency (%) |
| High_value | 16470 | |
| Medium_value | 13899 | |
| Low_value | 9662 |
| Value | Count | Frequency (%) |
| high_value | 16470 | |
| medium_value | 13899 | |
| low_value | 9662 |
Most occurring characters
| Value | Count | Frequency (%) |
| u | 53930 | |
| e | 53930 | |
| _ | 40031 | |
| v | 40031 | |
| a | 40031 | |
| l | 40031 | |
| i | 30369 | |
| H | 16470 | 3.9% |
| g | 16470 | 3.9% |
| h | 16470 | 3.9% |
| Other values (6) | 70683 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 338384 | |
| Uppercase Letter | 40031 | 9.6% |
| Connector Punctuation | 40031 | 9.6% |
Most frequent character per category
| Value | Count | Frequency (%) |
| u | 53930 | |
| e | 53930 | |
| v | 40031 | |
| a | 40031 | |
| l | 40031 | |
| i | 30369 | |
| g | 16470 | 4.9% |
| h | 16470 | 4.9% |
| d | 13899 | 4.1% |
| m | 13899 | 4.1% |
| Other values (2) | 19324 | 5.7% |
| Value | Count | Frequency (%) |
| H | 16470 | |
| M | 13899 | |
| L | 9662 |
| Value | Count | Frequency (%) |
| _ | 40031 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 378415 | |
| Common | 40031 | 9.6% |
Most frequent character per script
| Value | Count | Frequency (%) |
| u | 53930 | |
| e | 53930 | |
| v | 40031 | |
| a | 40031 | |
| l | 40031 | |
| i | 30369 | |
| H | 16470 | 4.4% |
| g | 16470 | 4.4% |
| h | 16470 | 4.4% |
| M | 13899 | 3.7% |
| Other values (5) | 56784 |
| Value | Count | Frequency (%) |
| _ | 40031 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 418446 |
Most frequent character per block
| Value | Count | Frequency (%) |
| u | 53930 | |
| e | 53930 | |
| _ | 40031 | |
| v | 40031 | |
| a | 40031 | |
| l | 40031 | |
| i | 30369 | |
| H | 16470 | 3.9% |
| g | 16470 | 3.9% |
| h | 16470 | 3.9% |
| Other values (6) | 70683 |
days_bw_visits
Real number (ℝ≥0)
| Distinct | 382 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 54.69418114 |
|---|---|
| Minimum | 1 |
| Maximum | 351 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 625.5 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 15.26086957 |
| median | 43.66666667 |
| Q3 | 74 |
| 95-th percentile | 158 |
| Maximum | 351 |
| Range | 350 |
| Interquartile range (IQR) | 58.73913043 |
Descriptive statistics
| Standard deviation | 52.57700312 |
|---|---|
| Coefficient of variation (CV) | 0.9612906167 |
| Kurtosis | 4.667905908 |
| Mean | 54.69418114 |
| Median Absolute Deviation (MAD) | 28.4057971 |
| Skewness | 1.804998284 |
| Sum | 2189462.765 |
| Variance | 2764.341257 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 5490 | 13.7% |
| 7.575757576 | 695 | 1.7% |
| 15.26086957 | 587 | 1.5% |
| 16 | 584 | 1.5% |
| 13.42307692 | 575 | 1.4% |
| 17.73684211 | 572 | 1.4% |
| 13.76 | 523 | 1.3% |
| 40.6 | 376 | 0.9% |
| 43 | 373 | 0.9% |
| 50 | 354 | 0.9% |
| Other values (372) | 29902 |
| Value | Count | Frequency (%) |
| 1 | 5490 | |
| 1.5 | 29 | 0.1% |
| 2 | 133 | 0.3% |
| 2.5 | 45 | 0.1% |
| 2.777777778 | 297 | 0.7% |
| Value | Count | Frequency (%) |
| 351 | 62 | |
| 344 | 34 | |
| 332 | 17 | < 0.1% |
| 325 | 19 | < 0.1% |
| 307 | 17 | < 0.1% |
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 625.5 KiB |
| Less than 30 days visit | |
|---|---|
| More than 60 days visit | |
| More than 30 & Less than 60 days visit |
Length
| Max length | 38 |
|---|---|
| Median length | 23 |
| Mean length | 27.44143289 |
| Min length | 23 |
Characters and Unicode
| Total characters | 1098508 |
|---|---|
| Distinct characters | 19 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Less than 30 days visit |
|---|---|
| 2nd row | Less than 30 days visit |
| 3rd row | Less than 30 days visit |
| 4th row | Less than 30 days visit |
| 5th row | Less than 30 days visit |
| Value | Count | Frequency (%) |
| Less than 30 days visit | 14239 | |
| More than 60 days visit | 13939 | |
| More than 30 & Less than 60 days visit | 11853 |
| Value | Count | Frequency (%) |
| than | 51884 | |
| days | 40031 | |
| visit | 40031 | |
| less | 26092 | |
| 30 | 26092 | |
| more | 25792 | |
| 60 | 25792 | |
| 11853 | 4.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| 207536 | ||
| s | 132246 | |
| t | 91915 | 8.4% |
| a | 91915 | 8.4% |
| i | 80062 | 7.3% |
| e | 51884 | 4.7% |
| h | 51884 | 4.7% |
| n | 51884 | 4.7% |
| 0 | 51884 | 4.7% |
| d | 40031 | 3.6% |
| Other values (9) | 247267 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 723467 | |
| Space Separator | 207536 | 18.9% |
| Decimal Number | 103768 | 9.4% |
| Uppercase Letter | 51884 | 4.7% |
| Other Punctuation | 11853 | 1.1% |
Most frequent character per category
| Value | Count | Frequency (%) |
| s | 132246 | |
| t | 91915 | |
| a | 91915 | |
| i | 80062 | |
| e | 51884 | 7.2% |
| h | 51884 | 7.2% |
| n | 51884 | 7.2% |
| d | 40031 | 5.5% |
| y | 40031 | 5.5% |
| v | 40031 | 5.5% |
| Other values (2) | 51584 | 7.1% |
| Value | Count | Frequency (%) |
| 0 | 51884 | |
| 3 | 26092 | |
| 6 | 25792 |
| Value | Count | Frequency (%) |
| L | 26092 | |
| M | 25792 |
| Value | Count | Frequency (%) |
| 207536 |
| Value | Count | Frequency (%) |
| & | 11853 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 775351 | |
| Common | 323157 |
Most frequent character per script
| Value | Count | Frequency (%) |
| s | 132246 | |
| t | 91915 | |
| a | 91915 | |
| i | 80062 | |
| e | 51884 | 6.7% |
| h | 51884 | 6.7% |
| n | 51884 | 6.7% |
| d | 40031 | 5.2% |
| y | 40031 | 5.2% |
| v | 40031 | 5.2% |
| Other values (4) | 103468 |
| Value | Count | Frequency (%) |
| 207536 | ||
| 0 | 51884 | 16.1% |
| 3 | 26092 | 8.1% |
| 6 | 25792 | 8.0% |
| & | 11853 | 3.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1098508 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 207536 | ||
| s | 132246 | |
| t | 91915 | 8.4% |
| a | 91915 | 8.4% |
| i | 80062 | 7.3% |
| e | 51884 | 4.7% |
| h | 51884 | 4.7% |
| n | 51884 | 4.7% |
| 0 | 51884 | 4.7% |
| d | 40031 | 3.6% |
| Other values (9) | 247267 |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| CustomerID | Transaction_ID | Transaction_Date | Product_SKU | Product_Description | Product_Category | Quantity | Avg_Price | Delivery_Charges | Coupon_Status | Gender | Location | Tenure_Months | GST | Transaction_Date_Month_x | Transaction_Date_Month_y | User_type | year_number | week_number | month_number | Coupon_Code | Discount_pct | Marketing_Date | Offline_Spend | Online_Spend | revenue | revenue_per_customer | revenue_seg | days_bw_visits | Visit_days_average | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 17850 | 16679 | 2019-01-01 | GGOENEBJ079499 | Nest Learning Thermostat 3rd Gen-USA - Stainless Steel | Nest-USA | 1 | 153.71 | 6.50 | Used | M | Chicago | 12 | 0.1 | 2019-01 | 2019-01 | New | 2019 | 1 | 1 | ELEC10 | 10.0 | 2019-01-01 | 4500 | 2424.5 | 144.189 | 60381.669 | High_value | 2.777778 | Less than 30 days visit |
| 1 | 17850 | 16680 | 2019-01-01 | GGOENEBJ079499 | Nest Learning Thermostat 3rd Gen-USA - Stainless Steel | Nest-USA | 1 | 153.71 | 6.50 | Used | M | Chicago | 12 | 0.1 | 2019-01 | 2019-01 | New | 2019 | 1 | 1 | ELEC10 | 10.0 | 2019-01-01 | 4500 | 2424.5 | 144.189 | 60381.669 | High_value | 2.777778 | Less than 30 days visit |
| 2 | 17850 | 16696 | 2019-01-01 | GGOENEBQ078999 | Nest Cam Outdoor Security Camera - USA | Nest-USA | 2 | 122.77 | 6.50 | Not Used | M | Chicago | 12 | 0.1 | 2019-01 | 2019-01 | New | 2019 | 1 | 1 | ELEC10 | 10.0 | 2019-01-01 | 4500 | 2424.5 | 258.540 | 60381.669 | High_value | 2.777778 | Less than 30 days visit |
| 3 | 17850 | 16699 | 2019-01-01 | GGOENEBQ079099 | Nest Protect Smoke + CO White Battery Alarm-USA | Nest-USA | 1 | 81.50 | 6.50 | Clicked | M | Chicago | 12 | 0.1 | 2019-01 | 2019-01 | New | 2019 | 1 | 1 | ELEC10 | 10.0 | 2019-01-01 | 4500 | 2424.5 | 88.000 | 60381.669 | High_value | 2.777778 | Less than 30 days visit |
| 4 | 17850 | 16700 | 2019-01-01 | GGOENEBJ079499 | Nest Learning Thermostat 3rd Gen-USA - Stainless Steel | Nest-USA | 1 | 153.71 | 6.50 | Clicked | M | Chicago | 12 | 0.1 | 2019-01 | 2019-01 | New | 2019 | 1 | 1 | ELEC10 | 10.0 | 2019-01-01 | 4500 | 2424.5 | 160.210 | 60381.669 | High_value | 2.777778 | Less than 30 days visit |
| 5 | 17850 | 16701 | 2019-01-01 | GGOENEBJ079499 | Nest Learning Thermostat 3rd Gen-USA - Stainless Steel | Nest-USA | 1 | 153.71 | 6.50 | Clicked | M | Chicago | 12 | 0.1 | 2019-01 | 2019-01 | New | 2019 | 1 | 1 | ELEC10 | 10.0 | 2019-01-01 | 4500 | 2424.5 | 160.210 | 60381.669 | High_value | 2.777778 | Less than 30 days visit |
| 6 | 17850 | 16702 | 2019-01-01 | GGOENEBJ079499 | Nest Learning Thermostat 3rd Gen-USA - Stainless Steel | Nest-USA | 2 | 153.71 | 6.50 | Clicked | M | Chicago | 12 | 0.1 | 2019-01 | 2019-01 | New | 2019 | 1 | 1 | ELEC10 | 10.0 | 2019-01-01 | 4500 | 2424.5 | 320.420 | 60381.669 | High_value | 2.777778 | Less than 30 days visit |
| 7 | 17850 | 16703 | 2019-01-01 | GGOENEBQ079099 | Nest Protect Smoke + CO White Battery Alarm-USA | Nest-USA | 2 | 81.50 | 6.50 | Not Used | M | Chicago | 12 | 0.1 | 2019-01 | 2019-01 | New | 2019 | 1 | 1 | ELEC10 | 10.0 | 2019-01-01 | 4500 | 2424.5 | 176.000 | 60381.669 | High_value | 2.777778 | Less than 30 days visit |
| 8 | 17850 | 16704 | 2019-01-01 | GGOENEBJ079499 | Nest Learning Thermostat 3rd Gen-USA - Stainless Steel | Nest-USA | 1 | 256.88 | 6.50 | Used | M | Chicago | 12 | 0.1 | 2019-01 | 2019-01 | New | 2019 | 1 | 1 | ELEC10 | 10.0 | 2019-01-01 | 4500 | 2424.5 | 237.042 | 60381.669 | High_value | 2.777778 | Less than 30 days visit |
| 9 | 17850 | 16710 | 2019-01-01 | GGOENEBJ079499 | Nest Learning Thermostat 3rd Gen-USA - Stainless Steel | Nest-USA | 1 | 153.71 | 28.78 | Clicked | M | Chicago | 12 | 0.1 | 2019-01 | 2019-01 | New | 2019 | 1 | 1 | ELEC10 | 10.0 | 2019-01-01 | 4500 | 2424.5 | 182.490 | 60381.669 | High_value | 2.777778 | Less than 30 days visit |
Last rows
| CustomerID | Transaction_ID | Transaction_Date | Product_SKU | Product_Description | Product_Category | Quantity | Avg_Price | Delivery_Charges | Coupon_Status | Gender | Location | Tenure_Months | GST | Transaction_Date_Month_x | Transaction_Date_Month_y | User_type | year_number | week_number | month_number | Coupon_Code | Discount_pct | Marketing_Date | Offline_Spend | Online_Spend | revenue | revenue_per_customer | revenue_seg | days_bw_visits | Visit_days_average | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 40021 | 16359 | 37354 | 2019-09-04 | GGOEGAAB033814 | Google Men's Vintage Badge Tee Black | Apparel | 1 | 7.60 | 6.5 | Clicked | F | New York | 49 | 0.18 | 2019-09 | 2019-08 | Existing | 2019 | 36 | 9 | SALE30 | 30.0 | 2019-09-04 | 2500 | 1255.01 | 14.10 | 89.152 | Low_value | 13.0 | Less than 30 days visit |
| 40022 | 16359 | 36274 | 2019-08-22 | GGOEGDHQ015399 | 26 oz Double Wall Insulated Bottle | Drinkware | 1 | 19.99 | 6.5 | Clicked | F | New York | 49 | 0.18 | 2019-08 | 2019-08 | New | 2019 | 34 | 8 | EXTRA20 | 20.0 | 2019-08-22 | 2500 | 1172.96 | 26.49 | 89.152 | Low_value | 13.0 | Less than 30 days visit |
| 40023 | 16359 | 36273 | 2019-08-22 | GGOEGHPJ080310 | Google Blackout Cap | Headgear | 1 | 13.29 | 6.0 | Not Used | F | New York | 49 | 0.05 | 2019-08 | 2019-08 | New | 2019 | 34 | 8 | HGEAR20 | 20.0 | 2019-08-22 | 2500 | 1172.96 | 19.29 | 89.152 | Low_value | 13.0 | Less than 30 days visit |
| 40024 | 15171 | 36604 | 2019-08-25 | GGOEGAAJ059116 | Google Men's Short Sleeve Performance Badge Tee Pewter | Apparel | 1 | 8.80 | 6.0 | Used | F | New Jersey | 48 | 0.18 | 2019-08 | 2019-08 | New | 2019 | 34 | 8 | SALE20 | 20.0 | 2019-08-25 | 2500 | 1941.38 | 11.84 | 146.860 | Low_value | 28.0 | Less than 30 days visit |
| 40025 | 15171 | 36604 | 2019-08-25 | GGOEGALB036514 | Google Women's Scoop Neck Tee Black | Apparel | 2 | 4.80 | 6.0 | Clicked | F | New Jersey | 48 | 0.18 | 2019-08 | 2019-08 | New | 2019 | 34 | 8 | SALE20 | 20.0 | 2019-08-25 | 2500 | 1941.38 | 21.60 | 146.860 | Low_value | 28.0 | Less than 30 days visit |
| 40026 | 15171 | 36604 | 2019-08-25 | GGOEGALJ034415 | Google Women's Vintage Hero Tee Platinum | Apparel | 1 | 4.56 | 6.0 | Clicked | F | New Jersey | 48 | 0.18 | 2019-08 | 2019-08 | New | 2019 | 34 | 8 | SALE20 | 20.0 | 2019-08-25 | 2500 | 1941.38 | 10.56 | 146.860 | Low_value | 28.0 | Less than 30 days visit |
| 40027 | 15171 | 36604 | 2019-08-25 | GGOEGALP034315 | Google Women's Vintage Hero Tee Lavender | Apparel | 1 | 7.60 | 6.0 | Not Used | F | New Jersey | 48 | 0.18 | 2019-08 | 2019-08 | New | 2019 | 34 | 8 | SALE20 | 20.0 | 2019-08-25 | 2500 | 1941.38 | 13.60 | 146.860 | Low_value | 28.0 | Less than 30 days visit |
| 40028 | 15171 | 36604 | 2019-08-25 | GGOEGALQ036614 | Google Women's Scoop Neck Tee White | Apparel | 2 | 4.80 | 6.0 | Used | F | New Jersey | 48 | 0.18 | 2019-08 | 2019-08 | New | 2019 | 34 | 8 | SALE20 | 20.0 | 2019-08-25 | 2500 | 1941.38 | 17.28 | 146.860 | Low_value | 28.0 | Less than 30 days visit |
| 40029 | 15171 | 38754 | 2019-09-22 | GGOEGAAJ073413 | Google Men's Short Sleeve Hero Tee Heather | Apparel | 1 | 15.19 | 6.0 | Clicked | F | New Jersey | 48 | 0.18 | 2019-09 | 2019-08 | Existing | 2019 | 38 | 9 | SALE30 | 30.0 | 2019-09-22 | 2500 | 1895.73 | 21.19 | 146.860 | Low_value | 28.0 | Less than 30 days visit |
| 40030 | 15171 | 38755 | 2019-09-22 | GGOEGAFB035815 | Google Men's Zip Hoodie | Apparel | 1 | 44.79 | 6.0 | Clicked | F | New Jersey | 48 | 0.18 | 2019-09 | 2019-08 | Existing | 2019 | 38 | 9 | SALE30 | 30.0 | 2019-09-22 | 2500 | 1895.73 | 50.79 | 146.860 | Low_value | 28.0 | Less than 30 days visit |